Get our free extension to see links to code for papers anywhere online!

Add to Chrome

Add to Firefox

Get Pro 💎 Log In/Sign Up 🚀

CatalyzeX

✏️ To add code publicly for 'REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback', sign in to proceed instantly

Continue with email

Continue with Google

Continue with Github

Continue with LinkedIn

Continue with Facebook

Continue with Twitter

© 2024 CatalyzeX

Privacy Policy Bugs? Contact Us

Follow us